Data preprocessing

en

WikiRank.net
ver. 1.6.2

Data preprocessing

Quality:

Data pre-processing - manipulation of data before it is analyzed. Article “Data preprocessing” in English Wikipedia has 22.9 points for quality (as of July 1, 2025). The article contains 13 references and 5 sections. The article also contains templates indicating quality issues, therefore its score was reduced by 2.55 points.

This article has the best quality in Arabic Wikipedia. However, this article is the most popular in English version.

Since the creation of article “Data preprocessing”, its content was written by 56 registered users of English Wikipedia and edited by 108 registered Wikipedia users in all languages.

The article is cited 116 times in English Wikipedia and cited 250 times in all languages.

The highest Authors Interest rank from 2001:

  • Local (English): #68944 in July 2022
  • Global: #94454 in June 2025

The highest popularity rank from 2008:

  • Local (English): #166732 in November 2021
  • Global: #281492 in November 2021

There are 13 language versions for this article in the WikiRank database (of the considered 55 Wikipedia language editions).

The quality and popularity assessment was based on Wikipédia dumps from July 1, 2025 (including revision history and pageviews for previous years).

The table below shows the language versions of the article with the highest quality.

Languages with the highest quality

#LanguageQuality gradeQuality score
1Arabic (ar)
معالجة مسبقة للبيانات
30.7716
2German (de)
Datenvorverarbeitung
24.3343
3Estonian (et)
Andmete eeltöötlemine
24.1248
4Hebrew (he)
עיבוד נתונים מקדים
23.6598
5Catalan (ca)
Preprocessament de dades
23.1432
6English (en)
Data preprocessing
22.9406
7Korean (ko)
데이터 전처리
15.839
8Russian (ru)
Предварительная обработка данных
15.8139
9Japanese (ja)
データ前処理
13.1821
10Persian (fa)
پیش-پردازش داده
11.5541
More...

The following table shows the most popular language versions of the article.

Most popular in all the time

The most popular language versions of the article "Data preprocessing" in all the time
#LanguagePopularity awardRelative popularity
1English (en)
Data preprocessing
617 154
2Portuguese (pt)
Pré-processamento de dados
22 527
3Russian (ru)
Предварительная обработка данных
19 961
4Japanese (ja)
データ前処理
12 535
5Arabic (ar)
معالجة مسبقة للبيانات
6 204
6Ukrainian (uk)
Попередня обробка даних
5 645
7Korean (ko)
데이터 전처리
3 601
8Malay (ms)
Prapemprosesan data
1 499
9Persian (fa)
پیش-پردازش داده
1 124
10Hebrew (he)
עיבוד נתונים מקדים
1 107
More...

The following table shows the language versions of the article with the highest popularity in the last month.

Most popular in June 2025

The most popular language versions of the article "Data preprocessing" in June 2025
#LanguagePopularity awardRelative popularity
1English (en)
Data preprocessing
1 509
2Korean (ko)
데이터 전처리
233
3Russian (ru)
Предварительная обработка данных
108
4Arabic (ar)
معالجة مسبقة للبيانات
69
5Japanese (ja)
データ前処理
53
6Portuguese (pt)
Pré-processamento de dados
47
7Ukrainian (uk)
Попередня обробка даних
38
8German (de)
Datenvorverarbeitung
23
9Malay (ms)
Prapemprosesan data
12
10Hebrew (he)
עיבוד נתונים מקדים
11
More...

The following table shows the language versions of the article with the highest Authors’ Interest.

The highest AI

Language versions of the article "Data preprocessing" with the highest Authors Interest (number of authors). Only registered Wikipedia users were taken into account.
#LanguageAI awardRelative AI
1English (en)
Data preprocessing
56
2Hebrew (he)
עיבוד נתונים מקדים
9
3Portuguese (pt)
Pré-processamento de dados
8
4Japanese (ja)
データ前処理
6
5Arabic (ar)
معالجة مسبقة للبيانات
5
6Estonian (et)
Andmete eeltöötlemine
5
7Russian (ru)
Предварительная обработка данных
5
8German (de)
Datenvorverarbeitung
4
9Persian (fa)
پیش-پردازش داده
4
10Malay (ms)
Prapemprosesan data
2
More...

The following table shows the language versions of the article with the highest Authors’ Interest in the last month.

The highest AI in June 2025

Language versions of the article "Data preprocessing" with the highest AI in June 2025
#LanguageAI awardRelative AI
1German (de)
Datenvorverarbeitung
4
2Arabic (ar)
معالجة مسبقة للبيانات
2
3Catalan (ca)
Preprocessament de dades
0
4English (en)
Data preprocessing
0
5Estonian (et)
Andmete eeltöötlemine
0
6Persian (fa)
پیش-پردازش داده
0
7Hebrew (he)
עיבוד נתונים מקדים
0
8Japanese (ja)
データ前処理
0
9Korean (ko)
데이터 전처리
0
10Malay (ms)
Prapemprosesan data
0
More...

The following table shows the language versions of the article with the highest number of citations.

The highest CI

Language versions of the article "Data preprocessing" with the highest Citation Index (CI)
#LanguageCI awardRelative CI
1English (en)
Data preprocessing
116
2Arabic (ar)
معالجة مسبقة للبيانات
47
3Korean (ko)
데이터 전처리
30
4Japanese (ja)
データ前処理
20
5Russian (ru)
Предварительная обработка данных
14
6Ukrainian (uk)
Попередня обробка даних
7
7Portuguese (pt)
Pré-processamento de dados
6
8Catalan (ca)
Preprocessament de dades
3
9Persian (fa)
پیش-پردازش داده
2
10Hebrew (he)
עיבוד נתונים מקדים
2
More...

Scores

Estimated value for Wikipedia:
English:
Global:
Popularity in June 2025:
English:
Global:
Popularity in all years:
English:
Global:
Authors in June 2025:
English:
Global:
Registered authors in all years:
English:
Global:
Citations:
English:
Global:

Quality measures

Interwikis

#LanguageValue
arArabic
معالجة مسبقة للبيانات
caCatalan
Preprocessament de dades
deGerman
Datenvorverarbeitung
enEnglish
Data preprocessing
etEstonian
Andmete eeltöötlemine
faPersian
پیش-پردازش داده
heHebrew
עיבוד נתונים מקדים
jaJapanese
データ前処理
koKorean
데이터 전처리
msMalay
Prapemprosesan data
ptPortuguese
Pré-processamento de dados
ruRussian
Предварительная обработка данных
ukUkrainian
Попередня обробка даних

Popularity rank trends

Best Rank English:
#166732
11.2021
Global:
#281492
11.2021

AI rank trends

Best Rank English:
#68944
07.2022
Global:
#94454
06.2025

Languages comparison

Important global interconnections (July 2024 – June 2025)

Wikipedia readers most often find their way to information on Data pre-processing from Wikipedia articles about Preprocessing, Principal component analysis, Data mining, Feature scaling and Data science. Whereas reading the article about Data pre-processing people most often go to Wikipedia articles on Data mining, Data cleansing, Canonical form, Feature engineering and One-hot.

Cumulative results of quality and popularity of the Wikipedia article

List of Wikipedia articles in different languages (starting with the most popular):

News from 10 February 2026

On 10 February 2026 in multilingual Wikipedia, Internet users most often read articles on the following topics: Jeffrey Epstein, Ghislaine Maxwell, Ilia Malinin, Bad Bunny, 2026 Winter Olympics, Epstein files, Jutta Leerdam, Jake Paul, Puerto Rico, Little Saint James.

In English Wikipedia the most popular articles on that day were: Bad Bunny, Jeffrey Epstein, Epstein files, Ilia Malinin, Savannah Guthrie, Ghislaine Maxwell, 2026 Winter Olympics, List of Super Bowl halftime shows, Eileen Gu, Jutta Leerdam.

About WikiRank

The WikiRank project is intended for automatic relative evaluation of the articles in the various language versions of Wikipedia. At the moment the service allows to compare over 44 million Wikipedia articles in 55 languages. Quality scores of articles are based on Wikipedia dumps from July, 2025. When calculating current popularity and AI of articles data from June 2025 was taken into account. For historical values of popularity and AI WikiRank used data from 2001 to 2025... More information